CDS

Accession Number TCMCG075C24369
gbkey CDS
Protein Id XP_007019714.2
Location complement(join(8961199..8961347,8961444..8961553,8961644..8961769,8961906..8961993,8962118..8962375,8962515..8962711,8962805..8962869,8962982..8963230))
Gene LOC18592775
GeneID 18592775
Organism Theobroma cacao

Protein

Length 413aa
Molecule type protein
Topology linear
Data_file_division PLN
dblink BioProject:PRJNA341501
db_source XM_007019652.2
Definition PREDICTED: probable beta-1,3-galactosyltransferase 8 isoform X1 [Theobroma cacao]

EGGNOG-MAPPER Annotation

COG_category G
Description Belongs to the glycosyltransferase 31 family
KEGG_TC -
KEGG_Module -
KEGG_Reaction -
KEGG_rclass -
BRITE ko00000        [VIEW IN KEGG]
ko01000        [VIEW IN KEGG]
ko01003        [VIEW IN KEGG]
KEGG_ko ko:K20855        [VIEW IN KEGG]
EC -
KEGG_Pathway -
GOs -

Sequence

CDS:  
ATGCAGCGTCTTTATTATGCCTTGTGCCCGTGGCAGCTTGACAACGAGTCCAAGAAAATGCGGGGAAAGGCAGTTTCAGGGAAAGCCATTTTCGTATTATGTCTTGCTAGCTTTCTTGCAGGATCACTGTTTACCAGCCGAACGTGGACTGCTTACACTTCTCATGACAAATATCATCCAACTCCACCCATTCAAAAGCATGCCAGCAATAAGTTGGGAGAAGTAGCCCGTGACTTTGATCGCAAACGTAAATTGGCTGAAGGAAAGGCAGAAGATATCATGGGGGAAGTCTCAAAAACTCACAAGGCTATCCAGTCACTTGACAAAACAATCTCCAACTTGGAAATGGAATTAGCAGTAGCTCGTATGAGCAAGACTAGTGCTGGAGGAATTTCCCTGGAAAGCAAATCTAATCAGACATTGCAGAAGGCTTTTGTGGTTATTGGAATTAACACAGCATTTAGCAGCAGGAAAAGAAGAGACTCTGTTCGAGAAACATGGATGCCTAGAGGAGAAAAACTGAAGAAATTGGAGAGAGAGAAAGGGATTGTTATAAGGTTTGTGATAGGGCACAGTGCCACACCAGGGGGTGTTCTGGATAAAGCACTGGACAGAGAAGAGGCAGAGCACAAGGACTTTCTTAGGCTGAAACACGTAGAAGGATACCACCAGCTGTCCACCAAGACCAGACTCTATTTCTCTACTGCTGTTGCCATATGGGACGCCCAATTCTATGTGAAGGTGGATGATGATGTCCATCTCAACTTAGGTATGCTAGCCAGCACGCTTGCACAATACCGATCCAAGCCCAGAGTGTATATCGGATGCATGAAGTCTGGACCAGTTCTTTCTCGCAAAGGGGTGAAATATCACGAACCAGAGTACTGGAAATTTGGAGAGGATGGAAACAAGTACTTCAGGCATGCCACTGGACAATTATATGGCATCTCCAAGGACCTTGCTGCCTATATTTCCATCAACTCCCCCATCTTGCATAGATACGCCAATGAGGACGTGTCTCTGGGATCATGGTTGATTGGCTTAGAAGTTGAACACGTGGACGACCGTTCCATGTGCTGTGGGACCCCTCCAGATTGTGAATGGAAGGCTCAAGCAGGGAATATCTGCGTGGCCTCATTCGATTGGTCGTGCAGCGGAGTATGCAATTCAGTGGAGAGAATGAAATATGTGCATTCCTCTTGCGGAGAAGGAGATGGTGCACTTTGGAAGGTTGATCTTTGA
Protein:  
MQRLYYALCPWQLDNESKKMRGKAVSGKAIFVLCLASFLAGSLFTSRTWTAYTSHDKYHPTPPIQKHASNKLGEVARDFDRKRKLAEGKAEDIMGEVSKTHKAIQSLDKTISNLEMELAVARMSKTSAGGISLESKSNQTLQKAFVVIGINTAFSSRKRRDSVRETWMPRGEKLKKLEREKGIVIRFVIGHSATPGGVLDKALDREEAEHKDFLRLKHVEGYHQLSTKTRLYFSTAVAIWDAQFYVKVDDDVHLNLGMLASTLAQYRSKPRVYIGCMKSGPVLSRKGVKYHEPEYWKFGEDGNKYFRHATGQLYGISKDLAAYISINSPILHRYANEDVSLGSWLIGLEVEHVDDRSMCCGTPPDCEWKAQAGNICVASFDWSCSGVCNSVERMKYVHSSCGEGDGALWKVDL